Alcohol-use disorders (AUD) and cannabis-use disorders (CUD) are prevalent substance-use disorders with significant implications for public health and individual well-being. This paper aims to investigate the influence of societal, economic, and governmental factors on the rates of AUD and CUD among different countries. The question guiding this research is:
To address this question, three broad categories of variables were considered. These categories were legislative/policy differences, economic differences, and social differences. Each category can then be broken up into the specific variables that were taken into consideration. Legislative/policy differences encompass government type, legal age to consume the substance, availability of treatment for substance-use disorders, and civil liberties. It was hypothesized that countries with more freedoms and fewer regulations pertaining to substance use will exhibit higher rates of AUD and CUD. Economic differences can be broken down into gross domestic product (GDP) per capita, poverty rates, and unemployment rates. It was hypothesized that as a country’s economic strength increases, so does the prevalence of AUD and CUD. Social differences can be broken down into major religions and the prevalence of specific types of alcohol. It was hypothesized that different religious affiliations and alcohol types would influence the rates of AUD and CUD. This research utilizes various datasets from different sources, which were combined into one dataset for analysis. This paper aims to contribute to the understanding of the different factors influencing the rates of AUD and CUD in different countries.
Keywords: alcohol-use, substance-use, cannabis, disorder, AUD, CUD
Alcohol-use disorders (AUD) and cannabis-use disorders (CUD) are two prevalent and clinically significant substance-use disorders (SUD) that have garnered considerable attention due to their impact on public health and individual well-being. Numerous studies have been published over time on the long-term effects of substance use and how they negatively affect health outcomes. Some of these negative health outcomes include developing high blood pressure, heart diseases, cancers, weakening of the immune system, and many more [3]. With so many negative health outcomes associated with different kinds of substance-use disorders, it is important to examine what factors increase or decrease the rates of AUDs and CUDs. In doing this, we hope to examine why certain countries may have higher or lower rates of substance use disorders than others, and what specific differences between countries may be attributed to these discrepancies. Our research question is: How do Societal Factors and Cross-Cultural differences affect the rates of Alcohol-Use Disorder and Cannabis-Use Disorder in different nations?
These prevalent substance-use disorders have garnered considerable attention due to their negative health outcomes, including high blood pressure, heart diseases, cancers, and weakened immune systems. Throughout this presentation, we will explore the factors that contribute to varying rates of AUDs and CUDs among countries, shedding light on the specific differences underlying these discrepancies.”
In only 2019, the NIH reported that alcohol-related deaths accounted for 2.07 million male deaths and 374 thousand female deaths globally [17].
In 2020 (the first year of the COVID pandemic) alcohol sales increased by 2.9%, which is the largest annual increase in the last 50 years [18]. Although this presentation does not include COVID era data, it still gives reliable indicators as to AUD trends.
2019 studies from the CDC estimate 3/10 people who regularly use cannabis have CUD [19].
As described by the most recent Diagnostic and Statistical Manual (DSM-V), which is what psychologists and doctors often refer to when determining whether someone fits the criteria of having a specific mental disorder [2]. The DSM-V describes 11 specific statements that are used as a guide to whether someone can be identified as someone with an AUD or a CUD, where agreement with 2 of the statements marks the presence of the disorder [2]. The prompts include statements like:
With a solid understanding of what makes up a substance-use disorder, we can begin to theorize potential differences among nations that may affect rates of SUDs. For the purposes of this project, we have divided our variables into three broad categories.
Legislative/Policy differences refer to factors that are controlled by the government of a nation. This includes the types of governments a nation associates with, specific policies about certain substances, availability of treatment for SUDs, and the level of freedom individuals in a nation’s population feel. We hypothesize that the more freedoms people have, the more they will indulge in alcohol use and cannabis use. Therefore, countries whose citizens have more freedoms and fewer regulations against the development of SUD will develop AUDs and CUDs at higher rates.
Economic differences refer to factors that can be used to define the financial stability of a country. These include gross domestic product (GDP) per capita, the ratio of population under the international poverty line ($2.15), and the ratio of people who are unemployed, but looking for/able to work. We hypothesize that as the economic strength of a nation increases, the rate of AUD/CUD will also increase. This is for a similar reason as the last hypothesis in that if people are given the ability to, and are financially able to drink or smoke, they will. Therefore, we hypothesize that countries with higher GDPs and lower rates of poverty and unemployment will have higher rates of AUDs/CUDs.
Social Differences refer to factors that are culturally significant and withheld by the people of a population. Examples of these include national religions that a country primarily associates with and the prevalence of specific kinds of alcohol (beer, wine, and spirits). We hypothesize that there will be a difference in AUD/CUD rates by different religions and alcohol prevalence.
The following section includes basic information about their collection, calculations, and manipulations as they are sized into our dataset. Many of the datasets gathered for his research were from different sources, so we thought it was important to distinguish how we obtained this data and how reliable it is in explaining its specific concentration.
The Global Health Data (GHD) is self-proclaimed as the world’s most comprehensive catalog of surveys, censuses, vital statistics, and other health-related data. We ran across this dataset looking for data about substance-use disorders. This data is pulled together by the coordination of two separate independent companies: The Global Burden of Disease and the Institute for Health Metrics and Evaluation. The Global Burden of Disease (GBD) is a global research program that provides quantifiable information about health issues regarding hundreds of different diseases, injuries, and other risk factors that may influence overall health [7]. The GBD collects data on different sexes and years from more than 9,000 researchers in more than 162 countries and territories about health issues. The Institute for Health Metrics and Evaluation (IHME) is an independent global health research center run by the University of Washington.[1] The IHME works with over 8,000 scientists, government officials, and medical professionals to provide accurate data on many diseases and other risk factors for mortality [10]. Along with that, the IHME uses surveys and questionnaires about the use and prevalence of drugs and medical treatments, and autopsy experiences, along with biometric data about each household. We pulled a series of variables from this dataset, including different measures, locations, years, sexes, and substance-use disorders. For the measures, we looked at deaths and incidences as a rate, meaning that it would look at both deaths and incidences per one hundred thousand people in a specific country. Specifically, we looked at rates of alcohol-use disorder (AUD) and cannabis-use disorder (CUD) from 1999-2019 among different sexes in different countries. For the most part, we primarily looked at the years 1999, 2009, 2019 as they were good indicators of the change that occurred over time.
Natural Earth is a public data website that keeps data about different countries and land masses, including name, type, and geography that allow for a computer-generated image of the country. The data for this could be installed as a CSV and loaded into RStudio. From this dataset, we only used data about the international recognition codes and the geographies, which we just selected and left_joined to our original dataset.
OWID also provides a dataset for Civil Liberties, which comes from Freedom House, an organization that rates and scores countries based on their relative liberties for citizens. Similar to V-Dem, Freedom House chooses a team of advisors and researchers to study each country or territory and report on their governing structure, rule of law, personal freedoms, etc [21]. These reports are reviewed by “expert advisors and regional specialists” before being published in the data. The dataset for each country’s civil liberties rating contains a discrete ranking from 1 to 7, with 1 being the freest and 7 the least free. Again, in the same fashion as with the RoW data, we factored this variable into smaller quantities due to the relatively small sample sizes for certain liberties ratings:
1-2 “high liberties”
3-5 “moderate liberties”
6-7 “low liberties.”
This ensures each factor has a large enough sample size for us to draw meaningful conclusions from the analysis.
Our World in Data (OWiD) led us to a dataset on the Regimes of the World (RoW), published by the Varieties of Democracy (V-Dem) project from the University of Gothenburg. Every year, the project selects 5 “Country Experts” from a larger list to complete a carefully crafted survey on various themes in the political structure, human freedoms, etc. The team at V-Dem meticulously researches and studies their list of Country Experts each year to see which experts know the most in which fields, using five criteria (expertise, connection, purpose, impartiality, diversity) to score the experts from 1 to 3, where 1 is a high priority recruitment. Then, the team starts recruiting, reaching out to experts from their ranked list until they have 5 from every country, who proceed to complete the survey [22]. Notably, this data is observational and not randomly sampled: the V-Dem team spends much of their resources selecting the Country Experts under the assumption that they are in a better position to accurately assess the state of their country in regard to the survey topics. Out of this huge V-Dem dataset, the RoW data breaks each country’s regime into one of four factors: closed autocracy, electoral autocracy, electoral democracy, and liberal democracy. This four-factor variable measuring the political structure and citizen involvement in executive leadership is the one we use in our analysis of alcohol use disorders.
WHO performed a Global Survey on Alcohol and Health in 2016, which surveyed 194 nations and territories to find the minimum age at which alcohol can legally be purchased off-premises there [16]. This is a different measure from the minimum legal age at which someone can be served alcohol on-premises (although in most nations, these are the same age). The dataset includes these measures for beer, wine, and spirits. However, we elected only to use the drinking age for off-premises sales of beer; the vast majority of nationalities in the dataset use the same age for all three types of alcohol, and including multiple variables in our analysis containing essentially the same information would be unhelpful. Using this data in its raw format, we had a factored categorical variable with 12 levels (representing different drinking ages) from Total ban to None (no minimum age for alcohol purchase). The data wasn’t usable in this form, because many of those 12 factors only contained a single (or occasionally a couple) observation. For this reason, we condensed this variable into a four-level factor: Total Ban, Over 18, 18 and Under, None. In this formulation, each level has over 10 observations, allowing for more reasonable use in regression modeling. Notably, the 18 and Under group is by far the largest, as the great majority of countries have legal drinking ages of 18.
Similar to the other datasets within the OWID website, data about the treatment percentile of the population was gathered. That is, what percent of the population who has an AUD is getting treatment for their disorder [12]. This data is based on estimates and was gathered by OWID from the World Health Organization (WHO). This data was gathered by WHO through national authorities of specific countries through the WHO Global Survey on Alcohol and Health [12]. Because of the specificity of this data, it was only gathered in 2008, which is why it is challenging to make inferences on the importance of this variable. Unlike the other datasets by OWID, there was not a lot of tidying of the data to be done. This one we could simply select the specific variables we wanted and just left_joined it from there, good to go!
The World Bank Organization is an institution that provides financial support to developing countries. The World Bank Group (WBG) is funded by different member countries of the United Nations with the primary goals of resolving extreme poverty and promoting prosperity among nations [14]. In order for the WBG to properly allocate funds to different developing countries, they have to collect data on different aspects of a specific country’s economy, including poverty rate, homelessness rates, and gross domestic product (GDP). These bits of information are gathered from World Bank national account data and OECD National Accounts data files. GDP is a commonly used measure of accurately quantifying a country’s economic stability. The GDP measure we chose to use was the sum of gross value added by all the resident producers in the economy, plus any property tax and minus any subsidies not included in the value of the products [9]. With this value representing the GDP of a specific nation, we divide that by the midyear population to get the GDP per capita. This GDP per capita variable was stored in the currency of US dollars. It is also important to note how these values were calculated without making deductions for depreciation and degradation of natural resources [9]. The dataset is very full with little to no NA values. The original dataset was formatted in a way that each year was its own column, and each country was its own row. To fix this to match the format of our original dataset, we used a pivot_longer that gave every country the same number of rows as there were years of data for them. From there, it was a simple left_join to combine those datasets.
Similarly, to the previous dataset, the WBG supplied us with a dataset about poverty rates in different nations over time. Data about poverty was gathered by primary household survey data from government statistical agencies and World Bank country departments [11]. Since one of the main goals of the WBG is resolving and understanding poverty, you would expect them to have comprehensive data on this issue. However, with such a complex issue as poverty, it is hard to accurately depict poverty among different countries over time. Thus, this data does have many NA values for a lot of the countries over time, but it is still a relevant indicator of economic stability within a country that could provide to be important when compared to rates of AUD and CUD. This poverty data represents the ratio of the population (in each specific country and year) that would be under the international poverty line of $2.15. Because of the changes in the purchasing power parity (PPP) exchange rate, we cannot compare the poverty rates of different years to each other. The original data was formatted in a way similar to the data above, which saw every year as its own column and each country as its own row. To fix this to work with our primary dataset, we used a pivot_longer to make the years into rows. Along with that, we cut the year range down to only years after 1999, since that was the earliest year our original dataset went to.
Following the same trend, this dataset was originally found on the WBG website. Originally, this data was gathered by the International Labour Organization (ILO), a specialized agency that works under the surveillance of the UN to help improve labor statistics. The ILO gathers labor data through a series of methods, including official estimates, administrative sources, household surveys, national accounts, and more [4]. The unemployment data measures the total share of the labor force without work but is available/seeking work [13]. One limitation of this data is that the survey data does not fully consider the complexities of some areas of work. For instance, individuals working in agriculture may make the data more skewed during certain seasons depending on what they are growing. Regardless, we believed this data would be useful, in conjunction with the GDP and poverty data, to analyze the economic status within a country and how that affects the rates of AUD/CUD. This data, like the previous two, had to be reformatted using the pivot_longer and filter functions to only include years after 1999.
In seeking to create a model for predicting rates of AUD by country, we began with an initial analysis of our individual variables. Many of our variables are categorical, factored, variables, which makes it important to understand the relationships between groups within each variable before trying to look at interactions between variables. Much of the analysis was completed by using ANOVAs and Pearsons correlation coefficient to determine the relationships between the variables and AUD/CUD rates.
| Year | Low-Moderate | Moderate-High | Low-High |
|---|---|---|---|
| 1999 | 114.3 | -205.8** | -320.1** |
| 2009 | 112.2 | -261.0*** | -373.2*** |
| 2019 | 33.8 | -254.2*** | -288.0** |
N=187. p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Year | Low-Moderate | Moderate-High | Low-High |
|---|---|---|---|
| 1999 | 6.5 | -42.6*** | -49.1*** |
| 2009 | 6.2 | -37.5*** | -43.7*** |
| 2019 | 5.8 | -36.1*** | -42.1*** |
N=187. p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Comparison | Closed Autocracy ~ Electoral Autocracy | Closed Autocracy ~ Electoral Democracy | Closed Autocracy ~ Liberal Democracy |
|---|---|---|---|
| p-value | 0.0406* | 0.0039** | 0.0000*** |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Year | Electoral Autocracy ~ Closed Autocracy | Electoral Democracy ~ Closed Autocracy | Liberal Democracy ~ Closed Autocracy |
|---|---|---|---|
| 1999 | 80.5 | 212.3 | 474.9*** |
| 2009 | 147.3 | 279.9* | 482.5*** |
| 2019 | 260.3* | 344.0** | 504.4*** |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
A similar trend was seen in cannabis as seen in alcohol, as citizens of Liberal Democracy had significantly higher rates of CUDs when compared to all other types of government styles. Negative numbers represent the latter regime of each cell having that number more cases while positive numbers represent the former having more cases.
| Year | Liberal Democracy ~ Electoral Autocracy | Liberal Democracy ~ Electoral Democracy | Liberal Democracy ~ Closed Autocracy |
|---|---|---|---|
| 1999 | 51.8*** | 39.4*** | 51.3*** |
| 2009 | 45.7*** | 36.2*** | 45.3*** |
| 2019 | 46.3*** | 41.3*** | 49.7*** |
N=167. p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Comparison | Under 18 ~ 18 | 18 ~ Over 18 | Under 18 ~ Over 18 |
|---|---|---|---|
| p-value | 0.8862 | 0.5296 | 0.992 |
Due to the high p-values on all the central ranges with age, we combined all three factors into one: Age Specific. This factor contains all countries except those with None and Total ban for drinking age. Despite a high p-value of 0.9561 indicating no significant difference between None and Total ban, we did not collapse these, because they are so fundamentally different as categories. The new Age Specific factor differs significantly from both other factors, shown by the Tukey’s test:
| Comparison | None ~ Age Specific | Age Specific ~ Total ban | None ~ Total ban |
|---|---|---|---|
| p-value | 0.0029 | 0.0002 | 0.7647 |
Since this applies to Alcohol-Use Disorder Treatment rates, we will not be analyzing Cannabis here. Since there was only data regarding the year 2008 in this dataset, we only wanted to compare the rates of AUD in that specific year to see if there was a correlation between them. With such little amounts of data in this area, after running the correlation test, we found that AUD treatment rate was weakly, negatively correlated with AUD rate per 100k, with a correlation coefficient of -0.218 (p = 0.2).
In comparing these quantitative, non-discrete variables, we used a Pearson’s product-moment correlation test that would determine the correlation of the data between two variables. A number farther away from 0 represented the fact that there was a stronger correlation between the two variables.
| Year | Degrees of Freedom | Correlation Coefficient | Overall Relationship |
|---|---|---|---|
| 1999 | 181 | 0.178* | Very Weak Positive (↑GDP, ↑AUD Rate) |
| 2009 | 192 | 0.210** | Weak Positive (↑GDP, ↑AUD Rate) |
| 2019 | 190 | 0.231** | Weak Positive (↑GDP, ↑AUD Rate) |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Year | Degrees of Freedom | Correlation Coefficient | Overall Relationship |
|---|---|---|---|
| 1999 | 181 | 0.479*** | Moderate Positive (↑ GDP, ↑CUD Rate) |
| 2009 | 192 | 0.448*** | Moderate Positive (↑ GDP, ↑ CUD Rate) |
| 2019 | 190 | 0.461*** | Moderate Positive (↑ GDP, ↑ CUD Rate) |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Year | Degrees of Freedom | Correlation Coefficient | Overall Relationship |
|---|---|---|---|
| 1999 | 33 | -0.218 | Weak Negative (↑ Poverty Proportion, ↓ AUD Rate) |
| 2009 | 75 | -0.373*** | Weak Negative (↑ Poverty Proportion, ↓ AUD Rate) |
| 2019 | 66 | -0.154 | Very Weak Negative (↑ Poverty Proportion, ↓ AUD Rate) |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Year | Degrees of Freedom | Correlation Coefficient | Overall Relationship |
|---|---|---|---|
| 1999 | 33 | -0.450** | Moderate Negative (↑ Poverty Proportion, ↓ AUD Rate) |
| 2009 | 75 | -0.312** | Weak Negative (↑ Poverty Proportion, ↓ AUD Rate) |
| 2019 | 66 | -0.172 | Very Weak Negative (↑ Poverty Proportion, ↓ AUD Rate) |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Year | Degrees of Freedom | Correlation Coefficient | Overall Relationship |
|---|---|---|---|
| 1999 | 178 | 0.239* | Weak Positive (↑ Unemployment, ↑ AUD Rate) |
| 2009 | 178 | 0.184* | Weak Positive (↑ Unemployment, ↑ AUD Rate) |
| 2019 | 178 | 0.02 | Very Weak Positive ((↑ Unemployment, ↑ AUD Rate)) |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
| Year | Degrees of Freedom | Correlation Coefficient | Overall Relationship |
|---|---|---|---|
| 1999 | 178 | -0.001 | Very Weak Negative (↓ Unemployment, ↑ CUD Rate) |
| 2009 | 178 | 0.020 | Very Weak Positive (↑ Unemployment, ↑ CUD Rate) |
| 2019 | 178 | -0.082 | Very Weak Negative (↓ Unemployment, ↑ CUD Rate) |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
With each variable individually analyzed, they were one-hot encoded into a multiple regression model for predicting AUD incidence rates in 2019. A correlation test was used to determine which factors across variables might be interrelated. Below are the strongest inter-variable correlations.
| Factors | Correlation Coefficient |
|---|---|
| GDP Per Capita- Wine Consumption | 0.61 |
| GDP Per Capita- Liberal Democracy | 0.72 |
| High Liberties- Liberal Democracy | 0.71 |
Using a combination of manual stepwise regression and correlation comparisons, a final model for predicting AUD incidences was found at 90% confidence, with a maximum adjusted R-square of 0.3348.
| Variable | Coefficient |
|---|---|
| Intercept | 475.44*** |
| Beer Consumption | 60.68* |
| Spirit Consumption | 49.04 |
| Wine Consumption | 58.14* |
| Islam | -142.15 |
| None | -198.18 |
| Electoral | 145.79* |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
Checking the model residuals for normality, the Q-Q Plot shows the residuals are nearly normal.
## -----------------------------------------------
## Test Statistic pvalue
## -----------------------------------------------
## Shapiro-Wilk 0.973 0.0054
## Kolmogorov-Smirnov 0.079 0.3177
## Cramer-von Mises 18.956 0.0000
## Anderson-Darling 1.1193 0.0061
## -----------------------------------------------
| Variable | Coefficient |
|---|---|
| Intercept | 6.06933*** |
| Beer Consumption | 0.10242** |
| Spirit Consumption | 0.07164 |
| Wine Consumption | 0.0807* |
| Islam | -0.39931*** |
| Electoral | 0.1762 |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
Testing for normality with Kolmogorov-Smirnov, we find a test statistic of 0.076 and p-value 0.3635, concluding we do not have evidence to conclude non-normality on 90% confidence. The quantile plot remains imperfect but is improved from the non-transformed model. Additionally, Cook’s Distance reveals there are no influential points of concern.
Next, we repeated this process for the AUD deaths in 2019, finding a model with maximum adjusted R-square 0.1672 and quickly realizing only one predictor was significant.
| Variable | Coefficient |
|---|---|
| Intercept | 1.1598** |
| Spirit Consumption | 1.0614*** |
## -----------------------------------------------
## Test Statistic pvalue
## -----------------------------------------------
## Shapiro-Wilk 0.7628 0.0000
## Kolmogorov-Smirnov 0.2395 0.0000
## Cramer-von Mises 19.3985 0.0000
## Anderson-Darling 10.5084 0.0000
## -----------------------------------------------
## -----------------------------------------------
## Test Statistic pvalue
## -----------------------------------------------
## Shapiro-Wilk 0.7538 0.0000
## Kolmogorov-Smirnov 0.2335 0.0000
## Cramer-von Mises 21.0804 0.0000
## Anderson-Darling 11.1195 0.0000
## -----------------------------------------------
## -----------------------------------------------
## Test Statistic pvalue
## -----------------------------------------------
## Shapiro-Wilk 0.9909 0.4647
## Kolmogorov-Smirnov 0.0429 0.9506
## Cramer-von Mises 11.1608 0.0000
## Anderson-Darling 0.2968 0.5872
## -----------------------------------------------
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
In the Kolmogorov-Smirnov test with 95% confidence, we find a test statistic of 0.2395 and p-value 0.0000, therefore rejecting the null hypothesis and concluding that the residuals are not normal.
Analyzing the residuals for outliers, there are three observations of concern: #12, #42, and #91. These correspond to the countries of El Salvador, Belarus, and Mongolia, respectively. Using Cook’s Distance to check for influential points, #12 sticks out.
The mean rate of AUD deaths globally for 2019 was 2.59 per 100,000 people. Belarus (#12) had a rate of 21.35. El Salvador (#42) had a rate of 14.76 and Mongolia (91) a rate of 15.77. Removing Belarus from the data does not improve the normality of the residuals : Kolmogorov-Smirnov reveals a test statistic of 0.209 and a p-value of 0.0000 on 95% confidence. However, there are no influential points for the new model fit without Belarus.
Performing a logarithmic transformation on our model for deaths by transforming the vertical axis, we refit the 95% confidence multiple regression with an adjusted R-square of 0.2232.
| Variable | Coefficient |
|---|---|
| Intercept | 0.07243 |
| Spirit Consumption | 0.13838* |
| Beer Consumption | 0.11883* |
| Islam | -0.49179* |
p < 0.10, *p < 0.05, **p < 0.01, ***p < 0.001.
In the transformed model, the Kolmogorov-Smirnov test has a test-statistic of 0.0429 and a p-value of 0.9506 on 95% confidence, suggesting there is no evidence to conclude the residuals are non-normal. The quantile and Cook’s Distance plots affirm this.
The purpose of this study was to investigate the impact of societal, economic, and governmental factors on the rates of alcohol-use disorder and cannabis-use disorder in different countries. The research focused on legislative/policy differences, economic differences, and societal differences.
Legislative/policy differences refer to factors that are controlled by the government of a country, such as specific policies about substances, availability of treatment for substance-use disorders, and the level of freedom individuals have in their country. The hypothesis was countries with more freedoms and fewer regulations would show higher rates of substance-use disorders. The analysis supported this hypothesis, as countries with higher ratings of civil liberties also had higher rates of AUDs. In 1999, countries with higher ratings of civil liberties had 205.8 more cases per 100k than countries with moderate liberties (p<0.01) and 320.1 more cases per 100k than countries with low civil liberties ratings (p<0.01). This trend is consistent across the years 2009 and 2019 as well. In 2009, countries with higher ratings of civil liberties had 261.0 more cases per 100k than countries with moderate liberties (p<0.001) and 373.2 more cases per 100k than countries with low civil liberties ratings (p<0.001). In 2019, countries with higher ratings of civil liberties had 254.2 more cases per 100k than countries with moderate liberties (p<0.001) and 288.0 more cases per 100k than countries with low civil liberties ratings (p<0.01). There were no statistically significant results for any of the three years when comparing low civil liberties to moderate civil liberties. This trend is consistent with cannabis-use disorders as well. There are no statistically significant results among low civil liberties and moderate civil liberties among any of the years analyzed. However, in 1999 there were 42.6 more cases per 100k in countries with high civil liberties than in countries with moderate civil liberties (p<0.001) and 49.1 more cases per 100k in countries with high civil liberties than in countries with low civil liberties (p<0.001). In 2009 there were 37.5 more cases per 100k in countries with high civil liberties than in countries with moderate civil liberties (p<0.001) and 43.7 more cases per 100k in countries with high civil liberties than in countries with low civil liberties (p<0.001). In 2019 there were 36.1 more cases per 100k in countries with high civil liberties than in countries with moderate civil liberties (p<0.001) and 42.1 more cases per 100k in countries with high civil liberties than in countries with low civil liberties (p<0.001). These results indicate a trend that higher civil liberties (being freer) is correlated with higher rates of substance-use disorders.
When looking at the regime type of each country in comparison to alcohol-use disorders, the results indicate a statistically significant difference between countries with a closed-autocracy compared to an electoral democracy (p=0.0039) and liberal democracy (p=0.0000), but not compared to an electoral autocracy (p=0.0406). Across the years, closed autocracy countries had 260.3 less people per 100k with an alcohol-use disorder than electoral autocracy countries (p<0.05), 344.0 less people with an alcohol-use disorder in closed autocracy countries than in electoral democracy countries (p<0.01), and 504.4 less people with an alcohol-use disorder than in liberal democracy countries (p<0.001). In 2009 there were 279.9 less people with an alcohol-use disorder in closed autocracy countries than in electoral democracy countries (p<0.05) and 482.5 less people with an alcohol-use disorder than in countries with a liberal democracy (p<0.001). In 1999 there were 474.9 less people with an alcohol-use disorder in closed autocracy countries than in countries with a liberal democracy (p<0.001). These results indicate that countries with less strict regimes (liberal democracy) have higher rates of alcohol-use disorders than in countries with more strict regimes (closed autocracy). Countries with a closed autocracy show the lowest rates of alcohol-use disorders, with electoral democracy the next highest amount, followed by electoral democracy, and liberal democracy being the highest amounts of alcohol-use disorders. This same trend is shown with cannabis-use disorders as well. For this comparison, we used liberal democracy as the baseline to compare the other regimes against. Across all comparisons for every year, the results were statistically significant at p<0.001. In 1999 liberal democracies showed 51.8 more people with a cannabis-use disorder than electoral autocracies, 39.4 more cases per 100k than electoral democracies, and 51.3 more cases per 100k than closed autocracies. In 2019, liberal democracies showed 45.7 more cases per 100k than electoral democracies, 36.2 more cases per 100k than electoral democracies, and 45.3 more cases per 100k than closed autocracies. In 2019, liberal democracies showed 46.3 more cases per 100k than electoral autocracies, 41.3 more cases per 100k than electoral democracies, and 49.7 more cases per 100k than closed autocracies. The same trends are seen among cannabis-use disorders as alcohol-use disorders, with closed and electoral autocracies overlapping more with cannabis-use disorders.
The legal drinking age for a country refers to the minimum age it is legal to consume alcohol. We split these ages into categories of under 18 years old, 18 years old, over 18 years old, no limit, and total ban. There were no statistically significant results when comparing the under 18 category to the 18 years old category, the 18 years old category to the over 18 category, the under 18 category to over 18 category, or the no ban category to the total ban category. We collapsed under 18, 18, and over 18 into one category titled ‘age specific.’ However, there were statistically significant results when comparing no limit to an age specific limit (p=0.0029), and an age specific limit to a total ban (p = 0.0002). These results indicate that a country having a total ban or no ban at all are significant predictors of alcohol-use disorders in comparison to an age specific limit. Countries with a total ban on average have significantly less alcohol-use disorders than countries with an age limit, and countries with no ban are also lower, but slightly higher than countries with a total ban. What this data tells us, across each variable within legislative factors, is that the less strict a country’s government is, the higher chance their citizens have of developing a substance-use disorder.
In 1999 there was a very weak, positive correlation between GDP and alcohol-use disorders per country. Therefore, the higher the GDP per capita, the higher the rates of alcohol-use disorder (r=0.178, p<0.05). In 2009 and 2019 this trend grew more correlated and more significant, becoming a weak positive correlation for both years (r=0.210, p<0.01 in 2009 and r=0.231, p<0.01 in 2019). These correlations are continued for cannabis-use disorder; however they are stronger and more significant than in alcohol-use disorder. In 1999, GDP per capita and CUD showed a correlation coefficient of 0.479 (p<0.001). In 2009 this correlation was 0.448 (p<0.001) and in 2019 the correlation coefficient was 0.461 (p<0.001). These all indicate a moderate, positive correlation between the two variables.
Among the correlations between poverty and alcohol use disorder, only the year 2009 was significant, with a correlation coefficient of -0.373 and a p-value of <0.001. This indicates a weak, negative correlation, showing that as the proportion of a country’s population in poverty increases, rates of alcohol-use disorders decrease. Among the data for cannabis-use disorders, 1999 and 2009 both showed significant correlations, r=-0.450 (p<0.01) in 1999 and r=-0.312 (p<0.01) for 2009. These numbers indicate a moderate negative correlation between poverty proportion and CUD in 1999 and a weak negative correlation between poverty proportion and CUD in 2009.
Unemployment rates and AUD were correlated with a p-value less than 0.05 in 1999 and 2009. In 1999 the correlation coefficient between the two variables was 0.239, and 0.184 in 2009. These both indicate weak positive correlations between unemployment and AUD, indicating that as unemployment rates increase, so do rates of alcohol-use disorder. These trends are not seen among cannabis-use disorder and unemployment rates, as those relationships among the three years are not statistically significant. What this data tells us, among GDPs per capita, poverty, and unemployment rate is that the more financially strong a country is, the higher rates of alcohol and cannabis-use disorders that country experiences. This is likely due to there being more stable access to the substances for individuals in that country.
Following all the previous analyses, we ran a multiple regression model across all the variables in order to determine which variables had the most correlation. The highest correlation coefficient was between GDPs per capita and liberal democracy countries, which was r = 0.72. This indicates a strong positive correlation. What this tells us is that on average, countries with a liberal democracy tend to be more economically strong.
The next highest correlation coefficient was between high liberties (civil liberties) and liberal democracy (regime). This coefficient came out to r=0.71, indicating a strong positive correlation between the two. This makes sense considering many of the countries with high civil liberties ratings are countries with a liberal democracy (United States of America, Spain, Ireland, etc.).
The third strongest correlation coefficient was between GDPs per capita and wine consumption, with a coefficient of r = 0.61. This indicates a moderate positive correlation. This tells us that has GDP per capita increases, so does wine consumption. Further research would be needed to understand why, but one could infer that wine is typically associated with class and sophistication, and therefore consumption would increase as people have more money.
A final model was created to predict incidences of AUD. This model shows predictions at a confidence interval of 90%, with a maximum adjusted R-square of 0.3348.
The limitations of this research as we conducted is causality, data limitations, and ecological fallacy. Most of our data we can only draw correlational information from, and it may be difficult to say one or more variables directly caused AUD or CUD rates to increase or decrease. With that, we did not run any analyses to check for interaction effects, so we are unable to conclude if any interactions between variables had a significant impact on AUD and CUD rates. While we were able to find and analyze quite a bit of data from varying datasets, there were also some variables we wanted to look at but just couldn’t find good data or enough data to include in our research. A third limitation of this research is ecological fallacy. A lot of our data describes countries as a whole rather than looking at smaller units of those countries and the differences that may exist within them. This is evident in the religion data, as we could only compare the major religion of each country, which doesn’t always accurately portray the whole country or even a majority of the country. For example, a country may have 3 major religions, 35% one religion, 33% another religion, and 32% a third religion, and the country would be labeled as the first religion, as it is the highest number. This, however, ignores 65% of the country’s religious affiliation. Another limitation of our research is the timeframe. Our main dataset only has data through 2019, which was just before the Covid pandemic really hit. It would be interesting to find more current data to see if and how Covid-19 has changed rates of alcohol and cannabis use disorders.
[1] (“About GBD | Institute for Health Metrics and Evaluation”, n.d.) About GBD. Institute for Health Metrics and Evaluation. (2023, February 10). https://www.healthdata.org/gbd/about
[2] American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (5th ed.). https://doi.org/10.1176/appi.books.9780890425596
[3] Centers for Disease Control and Prevention. (2022, April 14). Drinking too much alcohol can harm your health. learn the facts. Centers for Disease Control and Prevention. https://www.cdc.gov/alcohol/fact-sheets/alcohol-use.htm#:~:text=Over%20time%2C%20excessive%20alcohol%20use,liver%20disease%2C%20and%20digestive%20problems.
[4] Data Collection and Production. ILOSTAT. (n.d.). https://ilostat.ilo.org/about/data-collection-and-production/
[5] Datopian. (n.d.). World religion projections. DataHub. https://datahub.io/sagargg/world-religion-projections#resource-by_rounded_percentage_share
[6] Every Country in the World. (2022, November 7). Main religion of every country, every country in the world. Every Country in the World. https://www.everycountryintheworld.com/religions/
[7] GBD results. Institute for Health Metrics and Evaluation. (n.d.). https://vizhub.healthdata.org/gbd-results/?params=gbd-api-2019-public%2Fee5146ecf10e09042077bcd5542b7780
[9] GDP per capita (current US$). World Bank Open Data. (n.d.). https://data.worldbank.org/indicator/NY.GDP.PCAP.CD
[10] How we collect data. Institute for Health Metrics and Evaluation. (2022, November 4). https://www.healthdata.org/data-tools-practices/data-collection
[11] Poverty headcount ratio at $2.15 a day (2017 PPP) (% of population). World Bank Open Data. (n.d.-b). https://data.worldbank.org/indicator/SI.POV.DDAY
[12] Share of people with alcohol use disorders receiving treatment. Our World in Data. (n.d.). https://ourworldindata.org/grapher/share-with-alcohol-use-disorders-receiving-treatment
[13] Unemployment, total (% of total labor force) (modeled ILO estimate). World Bank Open Data. (n.d.). https://data.worldbank.org/indicator/SL.UEM.TOTL.ZS
[14] World Bank Group - International Development, Poverty, & Sustainability. World Bank. (n.d.).https://www.worldbank.org/en/home#:~:text=The%20World%20Bank%20has%20two,prosperity%20in%20a%20sustainable%20way.
[16] World Health Organization. (n.d.). Gho | by category | age limits - alcohol service/sales - by country. World Health Organization. https://apps.who.int/gho/data/node.main.A1144?lang=en
[18] https://www.niaaa.nih.gov/news-events/research-update/deaths-involving-alcohol-increased-during-covid-19-pandemic#:~:text=In%202020%2C%20the%20first%20year,increase%20in%20over%2050%20years.&text=For%20those%20who%20were%20drinking,alcohol%20misuse%20are%20contributing%20factors
[19] https://www.cdc.gov/marijuana/health-effects/addiction.html#:~:text=One%20study%20estimated%20that%20approximately,marijuana%20have%20marijuana%20use%20disorder.&text=Another%20study%20estimated%20that%20people,10%25%20likelihood%20of%20becoming%20addicted
[20] https://ourworldindata.org/alcohol-consumption#alcohol-consumption-vs-income
[21] https://ourworldindata.org/grapher/civil-liberties-rating-fh
Social Data
Religion Data
When thinking about what factors may predict the amount of alcohol use disorders, we predicted that religion would play a role. Following this idea, we searched for a dataset on the major religion of each country. We were unable to find one dataset that had all, or a majority of the countries. To combat this issue, we used the dataset from Every Country in the World which had collected data on 193 countries. We then used datahub.io to fill in the missing countries. The data from EveryCountryInTheWorld was collected in 2023, so we matched the year for the countries pulled from datahub.io. Regarding the specific religions included in the data, the data contained Islam, Christianity, Buddhism, Hindu, Judaism, Shintoism, Muslim, Folk, and None. As some religions only occurred once or twice in the dataset and were therefore not frequent enough to provide enough insight, we ended up combining some religions into one category. The categories became Islam, Christianity, and Other. It is important to note that this dataset only looks at the predominant religion in each country, so it does not look at each individual in the country, so we cannot account for all the variance in the data [5][6].
Alcohol-Type Consumption Data
In trying to predict incidences and deaths resulting from alcohol use disorder, we hypothesized that the quantity of alcohol consumed would be a related factor. OWID led us to the World Health Organization’s (WHO) and Global Health Observatory (GHO), which records data on the average alcohol consumption per capita for countries and territories, measured in liters of pure alcohol per person per year, for three types of alcohol: beer, wine, and spirits [20]. This means less concentrated alcoholic substances, such as beer and wine, follow a conversion chart into liters of pure alcohol (details of the conversion can be found here). To find the average per capita, GHO takes the cumulative sum for each country, divided by its population. Notably, this is unlikely to be an effective measure of the typical observation because alcohol consumption is likely to be skewed, as some people do not drink at all. For this reason, a median would be more accurate, but we believe the mean still provides indicative information about alcohol use disorders, so we chose to consider it in our analysis. Back to the data collection, these metrics come only from measurable alcohol consumption- that of bought, sold, and produced alcohol- and cannot account for at-home, personal brewing/distilling. We used this data as it is, with no mutations or adjustments. Because the data is already population-adjusted by GHO, it won’t simply be higher for larger countries and lower for smaller countries, so the information provided from the variables will be useful for understanding AUD patterns at the individual level within countries.